Heuristic discretization method for Bayesian Networks

نویسندگان

  • Mariana D. C. Lima
  • Silvia M. Nassar
  • Pedro Ivo R. B. G. Rodrigues
  • Paulo José de Freitas Filho
  • Carlos M. C. Jacinto
چکیده

Bayesian Network (BN) is a classification technique widely used in Artificial Intelligence. Its structure is a Direct Acyclic Graph (DAG) used to model the association of categorical variables. However, in cases where the variables are numerical, a previous discretization is necessary. Discretization methods are usually based on a statistical approach using the data distribution, such as division by quartiles. In this article we present a discretization using a heuristic that identifies events called peak and valley. Genetic Algorithm was used to identify these events having the minimization of the error between the estimated average for BN and the actual value of the numeric variable output as the objective function. The BN has been modeled from a database of Bit’s Rate of Penetration of the Brazilian pre-salt layer with 5 numerical variables and one categorical variable, using the proposed discretization and the division of the data by the quartiles. The results show that the proposed heuristic discretization has higher accuracy than the quartiles discretization.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Discretizing Continuous Attributes While Learning Bayesian Networks

We introduce a method for learning Bayesian networks that handles the discretization of continuous variables as an integral part of the learning process. The main ingredient in this method is a new metric based on the Minimal Description Length principle for choosing the threshold values for the discretization while learning the Bayesian network structure. This score balances the complexity of ...

متن کامل

Mix-nets: Factored Mixtures of Gaussians in Bayesian Networks with Mixed Continuous And Discrete Variables

Recently developed techniques have made it possible to quickly learn accurate probability density functions from data in low-dimensional continuous spaces. In particular, mixtures of Gaussians can be fitted to data very quickly using an accelerated EM algorithm that employs multiresolution d-trees (Moore, 1999). In this paper, we propose a kind of Bayesian network in which low-dimensional mixtu...

متن کامل

Using Bayesian networks for bankruptcy prediction: Some methodological issues

This study provides operational guidance for using naïve Bayes Bayesian network (BN) models in bankruptcy prediction. First, we suggest a heuristic method that guides the selection of bankruptcy predictors from a pool of potential variables. The method is based upon the assumption that the joint distribution of the variables is multivariate normal. Variables are selected based upon correlations...

متن کامل

A Novel Discretization for Parameter Learning in Bayesian Network using Dynamic Programming

In AI and machine learning techniques such as decision trees and Bayesian networks, there is a growing need for converting continuous data into discrete form. Several approaches are available for discretization, however finding an appropriate and efficient discretization method is a challenging task. In this paper, we present an impurity based dynamic multi-interval discretization approach for ...

متن کامل

A Multivariate Discretization Method for Learning Bayesian Networks from Mixed Data

In this paper we address the problem of discretization in the context of learning Bayesian networks (BNs) from data con­ taining both continuous and discrete vari­ ables. We describe a new technique for multivariate discretization, whereby each continuous variable is discretized while tak­ ing into account its interaction with the other variables. The technique is based on the use of a Bayesian...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • JCS

دوره 10  شماره 

صفحات  -

تاریخ انتشار 2014